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We introduce a family of new centralities, the fc-spectral centralities. fc-Spectral centrality is a 
measurement of importance witli respect to the deformation of the graph Laplacian associated with 
the graph. Due to this connection, fc-spectral centralities have various interpretations in terms of 
spectrally determined information. 

We explore this centrality in the context of several examples. While for sparse unweighted net- 
works 1-spectral centrality behaves similarly to other standard centralities, for dense weighted net- 
works they show different properties. In summary, the fc-spectral centralities provide a novel and 
useful measurement of relevance (for single network elements as well as whole subnetworks) distinct 
from other known measures. 

PACS numbers: 



I. INTRODUCTION 



Over the last two decades, techniques of network anal- 
ysis have come to play an important role in the repre- 
sentation and understanding of the structure of complex 
systems. In particular, measures of importance and cen- 
trality have allowed to quantify aspects of networks which 
are fundamental to their structural or dynamical proper- 
ties. For example, highly central nodes in social networks 
can indicate actors with the most influence or, from a 
different point of view, those actors through which in- 
formation or disease must pass to reach the full graph. 
Different measures are crafted using different mechanisms 
and assumptions involved in measuring importance. Be- 
tweenness centrality [TT] and related measures 02] as- 
sign importance to nodes through which many shortest 
(geodesic) paths pass, while Random Walk Betweenness 
centrality [3T] considers all possible paths. Degree cen- 
trality simply assigns importance to those nodes with the 
highest number of edges. Eigenvector centrality 0, on 
the other hand, measures a type of relative importance 
- a node is important if it is neighbors with other im- 
portant nodes (see [22] for a review and discussion of 
many centrality measures). While these measures are of- 
ten (roughly) commensurate when analyzing sparse sym- 
metric graphs, they can diverge dramatically when con- 
sidering dense symmetric graphs. The basic reason is 
straightforward - dense graphs have many short paths as 
most nodes are connected to one another. Thus, mea- 
sures such as betweenness and degree centrality can be- 
come less meaningful - if all nodes have roughly the same 
degree and almost every node is connected to every other 
node, these measures carry very little information. In 
this paper, we introduce a new family of measures of 
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centrality, fc-spectral centralities, which can be more ef- 
fective for dense graphs. 

fc-Spectral centralities are measures of importance rel- 
ative to properties of the graph Laplacian, defined as 
L = D — A, in which A is the (symmetric) adjacency ma- 
trix and D a diagonal matrix with node connectivities as 
its terms: Da = di = aij. There are many interpre- 
tations of the information encoded in the graph Lapla- 
cian (and hence measured by the spectral centralities). 
First, the graph Laplacian is known to encode various 
aspects of the geometry and dynamics of the graph. For 
example, the number of zero eigenvalues gives the num- 
ber of connected components, the size of the first non- 
trivial eigenvalue - the Fiedler value - encodes aspects 
of the graph's connectivity [31[H1[TD]. Alternatively, the 
Fiedler value encodes the rate at which a network can 
synchronize |28| . Second, the Laplacian operator's ac- 
tion on function supported on the graph determines the 
graph theoretic analogue of heat flow (see [T]). 

The spectral decomposition of the Laplacian therefore 
encodes the various modes of the heat flow. So, for net- 
works where such mechanism are appropriate (e.g. in 
models of the evolution of opinion in social networks) 
these modes identify relevant substructures of the graph. 
Third, the graph Laplacian plays a key role in the graph 
segmentation - into two pieces, via the relaxed Ratio- 
Cut problem [M] , or into multiple clusters using spectral 
clustering [25] . 

With these points of view in mind, we define fc-spectral 
centrality of a subset, B, of a connected graph (e.g. a 
single edge or node, or a whole subnetwork) by measuring 
the subset's relevance in terms of eigenvalues of the graph 
Laplacian. Let A be the adjacency matrix of the graph 
and L its associated graph Laplacian. We define the k- 
spectral centrality of B to be |A'^(0)| where Afe(t) is the 
fc*'' smallest nontrivial eigenvalue of the graph Laplacian 
of AB{t), the adjacency matrix for the network under a 
perturbation of B. While the definition applies to generic 
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subsets, we focus primarily on two special cases - single 
edges and single nodes (i.e. a node with all of its edges). 

fc-spectral centrality measures the extent to which a 
subset - e.g. a node or edge - contributes to the infor- 
mation encoded in the k^^ eigenvalue/eigenvector pair. 
For example, 1-spectral centrality encodes information 
on network connectivity. So, perturbations of subsets 
with high 1-spectral centrality have the most impact on 
connectivity. While this is similar in some respects to 
other centralities, for example betweenness centrality, in 
many cases 1-spectral centrality is a distinct measure. 
Similarly, higher spectral centralities provide measures of 
higher order connectivity as demonstrated by their role in 
efficient decomposition of graphs via spectral clustering. 

The computation of 1-spectral centrality for a network 
of N of nodes is on the order of 0{N^). This is com- 
parable to computational cost for other centralities such 
as random walk or eigenvector centrality, but we remark 
that many optimized algorithms (e.g. for betweenness 
centrality) consider usually a sparse graph (with number 
of links of the same order of magnitude of the number of 
ondes) thus they could be considerably slower on dense 
graphs. We examine 1-spectral centrality and other stan- 
dard centralities for several sparse networks: a toy model, 
the Zachary karate club network [3T] and a social net- 
work of dolphins JTS]. We then move to larger, dense 
networks: a network formed from roll call vote data for 
the 111'^ U.S. House of Representatives, a correlation 
network of equities prices from the S&P 500 and a bio- 
logical weighted network obtained from the Immune Sys- 
tem [30 . Generally, we see that for the small, sparse 
networks, 1-spectral centrality behaves similarly to ex- 
isting centralities while for the denser examples, there 
are significant differences. Higher spectral centralities 
(A: > 1) behave differently than existing centralities in all 
examples. 

We also discuss the spectral centralities in the context 
of applications to opinion propagation and consensus for- 
mation [71 [T71 [Ml [2S] where they have direct interpreta- 
tion. 



II. SPECTRAL GRAPH THEORY 

For this discussion, we assume that A is symmetric, 
connected, and without self loops, an — (weighted 
undirected graphs). As A is symmetric, L is symmet- 
ric and diagonalizable with eigendata given by (Ai, Wi)"=o 
with Ao < Ai < A2 < • • • < A„. We observe that Ao = 
as LI — 0. As we assume the network is connected, we 
know that Ai > [31[ni[Tn]. The first nontrivial eigen- 
value/eigenvector pair, known as the Fiedler value and 
vector, have been extensively studied (see, for example, 
PITU] for the initial work). 

The graph Laplacian is directly related to the so-called 
Ratio-Cut problem. For a graph with vertex set V and 
edge set E, a solution to the ratio-cut problem is a collec- 
tion of two subsets of nodes U and W so that V = UUW 



and 



RCut{U, W) 



#{e G E\3u eU,w eW with e = (u, w)} 
#U ■ #W} 



is minimized. This variant of the standard min-cut prob- 
lem favors solutions in which the sizes of U and W 
are comparable. Solving the ratio cut problem is NP- 
hard, but one can solve a relaxed version of the prob- 
lem via an analysis of the graph Laplacian [T3|. More 
precisely, if vi is the Fiedler vector associated with the 
graph Laplacian, then sign{vi) gives an indicator func- 
tion which provides an approximation of the solution - 
i.e. U = {i\vi{i) < 0},F = {i\viii) > 0}. The first 
I eigenvectors associated with nonzero eigenvalues pro- 
vide a dimension reduction of the data. This dimension 
reduction, the Z-dimensional spectral embedding^ simply 
uses the I vectors to give coordinates of the nodes in an 
I dimensional space. In our examples, many of the vi- 
sualizations are created using the 2-dimensional spectral 
embedding. The spectral embedding is the basis for the 
technique of spectral clustering where a Euclidean 
clustering method such as k-means is used to cluster the 
nodes given these coordinates. 



Spectral Centrality 

With the setup of the previous section, we define spec- 
tral centralities in terms of the effect of a deformation on 
the size of the eigenvalues. 

Definition 1 For an adjacency matrix A of a graph Q 
and a deformation of a subset B given by A/^ : {—S, d) — > 
A4 , a differentiable map of A in the space of symmetric 
matrices, the k-spectral centrality associated to the defor- 
mation is given by 

4 = 14(0)1 

where Afc(e) is a k*^ nontrivial eigenvalue of the graph 
Laplacian of Ag (e) . 

Our primary example is 1-spectral centrality. As the 
Fiedler value, Ai defines the so-called algebraic connec- 
tivity, we provide bounds for the vertex and edge con- 
nectivities of the graph ([9] flOl). Thus, we see from the 
definition that 1-spectral centrality is a measure of how 
much the deformation changes the algebraic connectivity 
(and hence the standard connectivity) of the network. 
As Ai also measures the speed of convergence of the syn- 
chronization dynamics on a network [28' , 1-spectral cen- 
trality can also be interpreted in terms of this. Other 
fc-centralities have different interpretations as well. For 
our purposes, we will mainly focus on 1-spectral central- 
ity and, in discussing higher spectral centralities, we will 
do so in the context of the role of the {Xk,Vk) in algo- 
rithms such as spectral clustering. We note that spectral 
perturbations arise in many applications and that the 
general idea of measuring importance via perturbation is 
similar to the to S- centrality framework of [16j. 
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Theorem 1 Let Aq be the adjacency matrix of a con- 
nected undirected network with k nodes. Suppose A : 
(—(5, (5) — > be a dijjerentiable path in the space of sym- 
metric matrices with A{0) — Aq giving a deformation of 
the matrix Aq. Let L(e) be the graph Laplacian asso- 
ciated with A{e) for each e G (—(5,(5). Let (Ai(e), Wi(e)) 
be the eigenvalue/unit eigenvector pairs for L(e) and as- 
sume that Afe(O) has multiplicity one. For S sufficiently 
small, Xk ■ {—5,6) R and Vk : (-(5,(5) S''^^, the 
eigenvalues and vectors associated with L{e), are differ- 
entiable functions. Then, 

AUO) = ^Q, 

where, C = FL' {Q)F , and F is the diagonal matrix with 
entries given by the entries of the eigenvector, Ufe(O). 

Proof: The existence of (5 > so that Afc(e) an(i Vk{e) 
are smooth follows from an application of the Implicit 
Function Theorem to the defining equation of eigenval- 
ues and the defining equation of eigenvectors respectively. 
Since we assume that network is connected, we know that 
Aa;(0) > for fc > 0. This, together with the multiplic- 
ity assumption implies that, shrinking 5 if necessary, we 
may conclude that (Afc(e), Wfc(e)) remain the k^^ nontriv- 
ial eigenvalue/eigenvector pair for e G (—(5,(5). 
As Afe(e) is an eigenvalue of L(e), we have 

det(i(e) - Afc(e)/) 0. 

Differentiating with respect to e using Jacobi's formula 
PP] yields: 

0= ^dct(L(e)-Afc(e)/) 

= tr{adme) - \k{e)I) ■ {L'{e) - A',(e)/)) 

where adj{M) is the adjugate of the matrix M. L{e) — 
\k{t)I is diagonalizable and, as we assume the multi- 
plicity of the fc"^ eigenvector of £(0) is one, has a single 
zero eigenvalue with associated eigenvector Wfc(e). Let- 
ting V be the matrix of orthonormal eigenvectors and 
{0 = /ii, /i2, • . . , /^n} be the eigenvalues, we have 

adj{L{e) - Afc(e)/) = det{VV')Vadj{M)V' 

where M is the diagonal matrix of /i^. As V'V = /, 
we have det{VV') = det{V'V) = 1. The adjugate of 
the diagonal matrix M is a diagonal matrix A where 
An = Ylj-iil^j- In particular, as fii = 0, only An is 
nonzero. We denote by A(e). So, in summary we 
have, 

adj{L{e) - \k{e)I) = A{e)vk{e)vk{ey . 
Continuing our computation, 

= tr{adjiL{e) - Afe(e)/) • (i'(e) - A',(e)/)) 
= A{e)tr{vk{eH{enL'{e)~X',{e)I)) 
= A{e)(tr{vk{e)vk{eYL'{e)) - X'^{e)tr{vk{e)vk{eY)) 
^ Amriv,{e)v,{eYL'ie))) - A{e)X[{e) 



The ir(wfc(e)ufc(e)*)) = 1 since ffc(e) is taken to have unit 
length for all e. Solving for A'^(e) yields 

X',ie)^triv,ie)v,{erL'ie)) 
= Y,u,{e)L'{e)v,{e) 

ij 

The resuh follows by letting C = F(e)L'(e)F(e) and eval- 
uating at e = 0. □ 

We make two comments. First, if an eigenvector has 
multiplicity greater than one, the deformation may still 
be constructed and analyzed. However, since there are 
potentially multiple independent vectors in the kernel of 
L(e) — Afc(e)/, the calculation breaks down. Second, as 
eigenvectors are not unique, the computation is not a 
priori well posed. However, as we see from the Theorem, 
|A'j,(0)| is independent of the choice of unit eigenvector. 

Next, we apply this to specific deformations using the 
graph Laplacian L — D — A. Computations using other 
definitions of the Laplacian and other deformations are 
easily computed from the previous theorem. 

Edge centrality 

For an edge between node i and node j, we consider 
the matrix where 

[0 otherwise 

Let L be the graph Laplacian of B^'^'^\ Then, observe 
that L-\-eL is the graph Laplacian of the original network 
under a deformation of the edge between node i and node 
j. Let {Xi{e) , Vi{e))2^Q be the eigendata associated with 
this deformed Laplacian. 

Corollary 1 Let A be an adjacency matrix which repre- 
sents a connected, undirected, weighted network. For the 
edge deformation B^^'^\ 

where Ufc(O) — (i^i(O), . . . , i'„(0)) is the k*^ eigenvector of 
the graph Laplacian associated with A. 

Considering the 1-spectral centrality, this result re- 
flects the intuition given by the link between this graph 
Laplacian and the ratio-cut problem. Edges have high 
1-spectral centrality if they connect two nodes which are 
very far apart when measured by the Fielder vector val- 
ues. These are precisely the edges of highest importance 
to the relaxed ratio-cut problem. 

Node centrality 

In a similar manner, we define the node spectral cen- 
trality by first defining a deformation. For node i, we 
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consider the matrix i?^*^ where 



B 



ii) 
kl 



Am if (fc,0 G 
otherwise 



Notice that we have picked the deformation shghtly dif- 
ferently, adding the weights of edges. Letting L be the 
graph Laplacian of B'^^\ L + eL is again the graph Lapla- 
cian of the original network under a deformation of all of 
the edges associated with node i proportionally to their 
weight. Note that L has a more complicated form that 
in the case of a single edge: 



/ an 



L = 











-an 







\ 



-an —ai2 



1 ^2,2+1 



V 



— a„ 



... ain ) 

(1) 

With this notation in place we have the following result. 



Corollary 2 Let A he an adjacency matrix which repre- 
sents a connected, undirected, weighted network. For the 
node deformation B^'^\ 



(2) 



where Ufe(O) = (i'i(O), . . . , i^„(0)) is the k*^ eigenvector 
of the graph Laplacian associated with A. 

Similarly to edge 1-spectral centrality, this makes sense 
in the context of the relaxed ratio cut problem - a node 
is as important as the weighted sum of its edges. 



Spectral centrality for general subsets 



which, in that terminology, follows walk trajectories - 
paths that are traced on the network can loop - and 
spreads via a parallel process - nodes spread simultane- 
ously to all neighbors rather than just one. To make this 
precise, we describe the flow process induced by the graph 
Laplacian in terms of information propagation, where we 
view the information on the network as a function sup- 
ported on the nodes. The values of this function then 
change over time according to the flow dynamics. In the 
case of dynamics governed by the graph Laplacian, the 
function value at the node is replaced by a value propor- 
tional to the weighted average of the function values over 
all the node's neighbors: 

xt+i ^ xt - Lxt = xt - Dxt + Axt 

soxt+i(i) = xt{i)~J2j ^'.ji^t{i)-xt{j)). This is the dis- 
crete analogue of heat flow instantiated on the network. 

As such, of the types of traffic - used goods, money, gos- 
sip, e-mail, attitudes, infection, and packages - described 
in [3], the spread of attitudes is modeled by this case. 
However, the diffusion mechanism is different than par- 
allel duplication (used in, for example, models of gossip 
propagation)- instead of a node exerting influence over its 
neighbors by duplication of its attitude, there is merely 
a nudging of the attitudes of neighbors which may be 
either tempered or enhanced by the attitudes of other 
neighbors. Thus, we see application of a notion of cen- 
trality based on the graph Laplacian to situations where 
this type of information diffusion is appropriate. Two of 
our main examples below - the network of roll call votes 
in the House of Representatives and the correlation net- 
work of the S&P 500 network - have substantial aspects of 
this type of information diffusion. Indeed, in both cases 
the behavior of the nodes is, at least in part, influenced 
by the behavior of the nodes they are connected to via 
measures of similarity. 



We can also do this with any subgraph - simply pick 
the deformation matrix, B, to be the adjacency matrix 
of the subgraph in question. As before, we let L be the 
graph Laplacian associated to B and we flnd a similar re- 
sult. Specifically, if A be an adjacency matrix which rep- 
resents a connected, undirected, weighted network. For 
the general deformation B, 

s% = Y,B.,,{vM-^A^)f 

where Wfc(O) = (i^i(O), . . . , t'„(0)) is the /c**^ eigenvector of 
the graph Laplacian associated with A. 

Relation to other measures of centrality 

In [3], Borgatti introduced a typology of centrality 
measures based on characterizations of their flow pro- 
cesses. The graph Laplacian induces a flow process 



Opinion dynamics 

As mentioned in the introduction, spectral centralities 
have a direct connection to the models of De Groot [7] 
and Lehrer '17] (see also [211 ES] for a broader frame- 
work of consensus in multi-agent systems with similar 
dynamics). Those models posit a social network encoded 
in an adjacency matrix A where the matrix is normal- 
ized so that the degree of each node is one. In that case, 
L ~ L — A. If we let Xi{t) be the measure of opinion of 
node i at time t, the model then updates the opinions by 
the weighted sum of neighboring opinions: 

xt+i = Axt 

= {I-L)xt 
^ Xt — Lxf 

Thus, under the further restriction that A has been 
normalized to unit degree, we have that the heat flow 
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modeled by the Laplacian is precisely this model of opin- 
ion dynamics. 

We note that some investigations using similar spectral 
perturbation methods in this direction have already been 
completed to maximize connectivity [12j or synchroniza- 
tion dynamics O [13] . With the assumptions above and 
our terminology, these are investigations related to the 1- 
spectral centrality. Interested readers should also see [27] 
for related work when considering social networks which 
incorporate interaction dynamics. 

III. RESULTS 

In this section, we discuss several different applica- 
tions of the node 1-spectral centrality of real networks 
and compare them to other centrality statistics - degree, 
betweenness centrality, random walk centrality, informa- 
tion, and eigenvector centrality. We begin with three 
examples of small unweighted, undirected networks and 
move to more complicated networks. The examples con- 
vey a general observation - that for small, undirected and 
unweighted networks which are not particularly dense, 
1-spectral centrality performs often similarly to exist- 
ing centrality measures. We next examine three dense 
weighted networks - a network associated to roll call 
votes in the U.S. House of Representatives, an equities 
network, and a biological network derived from the func- 
tion of the Immune System. These analyses provide ev- 
idence that for complicated networks, particularly dense 
weighted networks, 1-spectral centrality is substantially 
different than other measures. 

Higher spectral centralities {k > 1) behave completely 
differently from existing measures. The equities network 
case provides a good example of the environment where 
the fc-spectral centralities can be most useful - a dense, 
complex network with structure determined at multiple 
scales by (potentially) multiple processes. 

In the first two dense examples, the network edges are 
given by a similarity measure and, in each case, this re- 
sults in all-to-all connections. The last dense example is 
also almost a complete graph, with connections only dis- 
tinguished by their weights. Thus, we focus on weighted 
versions of the centrality measures which then allow sen- 
sible comparison to spectral centrality. 

As we shall see in the analysis, these examples pro- 
vide different glimpses into the varying interaction and 
relationship between the different measures. We argue 
that spectral centralities provide either a better way of 
understanding the data or provide a new window of in- 
vestigation. 

Unweighted sparsely connected networks 

To place 1-spectral centrality in the constellation of 
centrality measures, we will compute it as well as other 
measures on three simple networks. The first is a toy 



model - a hand made network that possesses features 
which reflect different aspects of notions of centrality. 
The other two examples are well known smaller networks 
which have been substantially studied in the literature - 
the Zachary karate club |31j and a social network of dol- 
phins p] . 



Toy model 

The toy model consists of 13 nodes. There is one "cen- 
tral node" - a pinch point which gives the only connec- 
tions between two halves of the network. In each half, 
there are six nodes with differing connectivity. Figure [l] 
(top left) shows the structure of the network as well as a 
comparison of betweenness centrality and 1-spectral cen- 
trality. We see visually that the two measures are qual- 
itatively the same. This is reflected in Table |lj which 
shows that all of the measures are fairly highly corre- 
lated. But, there is a distinction - 1-spectral centrality, 
betweenness and random walk centrality are very similar. 
The remaining three (degree, eigenvector and informa- 
tion centralities) are also very similar but the two groups 
have weaker cross-correlation. This can again be seen in 
the figure by observing the degree which, for example, 
is small for the most "between" node in the center but 
higher for others. 



Zachary karate club 

The Zachary karate club is a social network of 34 
nodes, individuals in the karate club, with edges given 
by social ties. The adjacency matrix for this network 
is unweighted and undirected. Computing degree, be- 
tweenness centrality, eigenvector centrality, random walk 
centrality, information centrality and 1-spectral central- 
ity, we see that (a general fact that has been previously 
observed) degree is closely related to all centralities, but 
that the centralities are differently related to one another. 
The results are summarized in Table [l| which shows the 
correlations between the various computations. Figure 
[T] (top right) illustrates the difference between between- 
ness centrality and 1-spectral centrality. Note that both 
measures give roughly the same results, but with differ- 
ent emphasis. This is illustrated by comparing the two 
for nodes 1 and 34. Node 1 has the highest centrality 
in both measures while node 34 has very high between- 
ness but only moderately high 1-spectral centrality. This 
is reflective of the differences in the ideas behind the 
two measures. Node 1 clearly bridges two parts of the 
network, either from the perspective of finding shortest 
paths or from the perspective of disconnecting the net- 
work (or making it more cohesively connected). On the 
other hand, node 34 plays a strong role in the configura- 
tion of shortest paths but its removal would not harm the 
connectivity of the graph as much as some other nodes. 
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(a) Toy network 




(b)Karate Club (c)Spectral v. 

Betweenness 
Centrality 



(d)Spectral v. 
Eigenvector 
centrality 



FIG. 1: Comparison of 1-spectral centrality and be- 
tweenness centrality in the Toy Network, the Zachary 
Karate club, and the dolphin social network. In (a) 

and (b), size of the nodes is proportional to the 1-spectral 
centrality while color (red is high, blue is low) is reflective 
of the betweenness centrality. In (c) and (d), color indicates 
1-spectral centrality (red is high, blue is low). In (c), size is 
proportional to betweenness and in (d) to eigenvector central- 
ity. 



Dolphin Network 

This data set consists of 64 nodes (dolphins) and edges 
between them representing a measure of social ties - in 
this case, affinity is measured via sustained proximity. 
Like the Zachary Karate Club network, the dolphin net- 
work is undirected and unweighted. The correlations of 
the various centrality measures are again summarized in 
Table [T] These computations have some similarities with 
those for the karate club - degree is highly correlated 
with betweenness, eigenvector, information and random 
walk centralities. However, the correlations are gener- 
ally not quite as strong and 1-spectral centrality is more 
weakly correlated with degree. Moreover, there is a small 
negative correlation between 1-spectral and eigenvector 
centrality. Thus, these two networks show some of the 
subtleties in the relationships between the various mea- 
sures. 

To give a sense of these relationships, in Figure [T] c) 
and d) we show two views of the network. In both views, 
color represents spectral centrality with red indicating 
high and blue indicating low spectral centralities. In c), 
the size of nodes is proportional to betweenness centrality 
while in d), size is proportional to eigenvector centrality. 
We see that as betweenness and 1-spectral centralities 
are highly correlated, the give qualitatively similar re- 
sults. In particular, two dolphins - Beescratch and SNIOO 
- have the highest scores in both measures. In contrast. 



the eigenvector centrality is quite different from the 1- 
spectral centrality. 




FIG. 2: Comparison of the centrality statistics for two 
dense, weighted networks. Left: 111th U.S. House roll call 
network Right: S&P 500 equities network. In each case, the x- 
axis is ordered by the Fiedler vector values of the associated 
graph Laplacian. Color indicates the centrality used: red 
— degree, green = betweenness, blue = eigenvector, cyan = 
information, magenta = random walk, black = spectral. 



Roll call votes in the 111' U.S. House of Representatives 

For this example, we use roll call data collected by J. 
Lewis and K. Poole, available at [18 . The data consist 
of all roll call votes for the 111'^ U.S. House of Repre- 
sentatives. For each member of the House, each of their 
votes is recorded as a yes (1), no (—1), or abstain/not 
present (0). We remove any legislator who did not vote 
in at least a third of the time. Then, omitting the miss- 
ing votes, we compute the percent of votes that pairs of 
legislators have in common. The resulting matrix, P, has 
values between and 1 and encodes, as a weighted undi- 
rected graph, the aggregate voting profiles. We note that 
this graph is dense - most legislators vote on most bills, 
so all legislators have some relation to one another. To 
create a weighted adjacency matrix, we use a scaling of 
the data given by 

{_ ii-Pijf 
e bri^j 
for i = j 

where a = 0.25. This value of a was selected to focus the 
analysis on the relevant aspects of the data. It represents 
a scaling of the raw percentages we initially computed by 
accounting for the fact that almost every pair of represen- 
tatives has at least half of their votes in common due to 
a host of noncontroversial bills. To ensure no self loops, 
we remove the diagonal entries. 

Table |TT] shows the correlations between the various 
centralities. In contrast to the previous cases, we see a 
marked difference between most traditional centralities 
and 1-spectral centrality. Degree, eigenvector, and infor- 
mation centrality are all highly correlated to one another 
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and are negatively correlated with 1-spectral centrality. 
Betweenness centrality is different from all others, ex- 
hibiting only a weak negative correlation with other cen- 
tralities. Random walk centrality shows almost a perfect 
correlation with 1-spectral centrality. Figure [s] (a) shows 
a graphical comparison of degree, betweenness, and 1- 
spectral centrality. 

This example begins to show one of the positive aspects 
of 1-spectral centrality. Analyses in political science (see 
[BmilES]) have shown that recent U.S. Legislative bodies 
are basically unidimensional with respect to the structure 
encoded in the roll call votes when analyzed with spatial 
models. This structure corresponds to our intuitive iden- 
tification by party and ideology. In that sense, members 
of the House that would be most "central" are ones whose 
ideological preferences lie somewhere between the median 
preferences of the two major parties. From Figure [3| we 
see that the traditional centrality measures are not overly 
informative from this point of view - they do not reflect 
this intuitive notion, instead assigning higher centrality 
scores to more peripheral members (the most peripheral, 
all the way to the left, is Rep. Ron Paul (R-TX)). 1- 
spectral centrality, on the other hand, better reflects the 
intuitive notion of centrality in terms of ideology. 

In this example, the 2-spectral centrality does not con- 
tain much useful information. The 2-centrality score for 
one node is very high (roughly 35) while the rest are 
between zero and one. This is possibly a reflection of 
the basic unidimensionality of the data at this scale - 
the Fiedler data captures the majority of the informa- 
tion, leaving little to be encoded in the rest of the eigen- 
data. Another indication of the lack of information in 
the higher spectral centralities is the substantial simi- 
larity between the 1-spectral centrality and the random 
walk centrality - both of these measure aspects of mov- 
ing randomly throughout the network, and their simi- 
larity indicates that higher order spectral data does not 
contribute significantly to the diffusion process. 

Equities network 

In this example, we begin with raw data given daily 
close prices over a three year period (7/2003-12/2006) of 
the equities in the S&P 500 index. We perform some 
preprocessing - converting the prices to percent changes 
from day to day and detrending the results. We then take 
the pairwise correlations, C^ , between these time series 
and form a similarity matrix by converting to chordal 
distance, Sij = sin(arccos(Cij)/2). We construct a scaled 
adjacency matrix: 

f -''^i 

Je ^ fori^j 
[ for i = j 

where cr — 0.25. This value of a was selected similarly to 
the example of the roll call network. Even after cleaning, 
the data still had substantial average correlation. This 
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FIG. 3: 1-spectral centralities in example dense 
weighted networks. Top: A comparison of degree, be- 
tweenness and spectral centralities of the roll call network 
of the 111*'' U.S. House of representatives. Color denotes 
party affiliation (red=Republican, blue=Democrat) while size 
is proportional to the relevant statistic. Nodes are ordered by 
the Fiedler vector. Bottom: The S&P 500 network given by 
its two dimensional spectral embedding where node size is 
proportional to spectral centrality. Coloring is given by an 
8-cluster spectral clustering using two eigenvectors, the edges 
are the 30 edges with the highest spectral centrality. 



value of a helps to promote signiflcant correlation and 
devalue average correlation. 

Table |TT] shows the correlations between the various 
centralities: betweenness centrality is again only weakly 
associated with other centralities, but degree, eigenvec- 
tor, random walk, information and 1-spectral centralities 
are all highly correlated. 

In contrast to the roll call network, spectral central- 
ity in the equities network is best interpreted in terms of 
its definition - what are the nodes which most determine 
the shape of the network? In this case, we see two groups 
with higher than average centrality which correspond to 
two sectors - the Basic Materials sector and the Technol- 
ogy sector. Roughly, the same groups are identified by 
the centralities correlated with 1-spectral centrality (14 
nodes in common in the top-ranking 20 for spectral cen- 
trality and degree). The 12 remaining nodes come from 
all positions on the list. Thus, 1-spectral centrality is de- 
tecting nodes with fairly low scores in other centralities. 
This reflects the fact that such nodes may still have a 
substantial contribution to the shape of the network de- 
spite not scoring particularly highly on other measures. 
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One way to view this is that these nodes have connec- 
tions which are significant to the solution to the relaxed 
ratio-cut problem. Figure [3] illustrates this idea. In that 
picture, nodes are placed according to the two dimen- 
sional spectral embedding with sizes proportional to their 

1- spectral centrality. The included edges are the thirty 
edges with the highest spectral edge centrality. One can 
see from this illustration how 1-spectral centrality reveals 
structure. The nodes in the lower right corner, a cluster 
of Technology equities, are the most central. If one com- 
putes the relaxed ratio cut, it is essentially this cluster 
which is separated from the rest. Evidence of this is en- 
coded in the edges - most of these thirty most central 
edges connect one of these nodes to nodes in other ar- 
eas of the network. However, some connect other pieces, 
showing higher order structure. 

Higher order spectral centralities carry interesting and 
useful information. Figure |4] shows the same spectral 
embedding of the network as shown in Figure |3] with 
node sizes proportional to the 1-spectral centrality (left), 

2- spectral centrality (middle), and 3-spectral centrality 
(right). The colors of the nodes in this figure are also in- 
dicative of the A:-spectral centralities with red indicating 
higher values and blue indicating lower ones. These three 
centralities show three distinct clusters of high centrality 
nodes. As discussed above, the cluster in the lower right 
hand side shows the Technology sector, which have high 
1-spectral centrality values. The cluster in the lower left 
hand corner, with high 2-spectral centrality, are members 
of the Basic Materials sector. The cluster emerging from 
the middle when scaled by the 3-spectral centrality is a 
group of equities from multiple sectors, with the majority 
being from the Healthcare sector. 

To better understand how the spectral centralities en- 
code structure, we consider a sequence of windows of the 
S&P 500 data. We compute the matrix A described 
above for every 450 day window of the data and com- 
pute the centralities for each window. This allows us to 
see how the centrality of various nodes changes over time. 
Figure [5] shows the results for 1, 2, and 3-spectral central- 
ity (top row), degree (bottom left), betweenness (bottom 
middle), and eigenvector centralities (bottom right). In 
all cases, the nodes, as labeled by the y-axis, are ordered 
by sector identification while the x-axis shows the win- 
dow index. For each case, the centralities are normalized 
to take values in [0, 1] so as to make the computations 
comparable. 

We see that in the case of 1-spectral centrality, there 
is a clear transition between one group, which is high 
for roughly the first 150 windows, and another which is 
high for the rest of the windows. The majority of eq- 
uities in the first group are members of the Technology 
sector while the second group is dominated by the Basic 
Materials sector. 1-Spectral centrality shows an aspect 
of the effect of the business cycle on the equities market - 
recording a transition from the dominance of Technology 
stocks to that of Basic Materials stocks. 2-Spectral cen- 
trality shows roughly the opposite picture, demonstrat- 



ing how the first two eigenvalue/eigenvector pairs encode 
this transition. 3-Spectral centrality shows a complemen- 
tary picture, where a different sector (Healthcare) rises 
in importance in later windows. 

In contrast, the same picture for the other centralities 
miss the dynamic picture entirely. Degree and eigenvec- 
tor centralities show prominence only for the Basic Mate- 
rials sector and with a peak at a different time than that 
of spectral centrality. Betweenness again emphasizes Ba- 
sic Materials and scattered other equities of importants. 
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FIG. 4: 1, 2, and 3-spectral centralities of the S&P 500 

network. Size is proportional to centrality scores. Color also 
reflects spectral centrality, with red indicating higher scores 
and blue lower ones. 

Spectral centralities also help us determine the rele- 
vance of the results of spectral clustering. The color of 
the nodes in Figure [3] is given by cluster membership 
where we have run the spectral clustering algorithm for 
five clusters using two eigenvectors. In choosing the pa- 
rameters for spectral clustering - the number of clusters 
and number of eigenvectors - one is always faced with 
the consequences of making poor choices. Choosing too 
many clusters leads to a version of overfitting, vectoriza- 
tion, where natural geometric clusters can be artificially 
subdivided. In this figure, we can see how the combi- 
nation of edge and node spectral centrality sheds light 
on this issue. Qualitatively, we see that the blue, yellow 
and purple clusters are significant from the perspective 
of spectral methods while the white, black, and light blue 
clusters are more likely due to vectorization. The green 
cluster, which sits among the latter group, has relevance, 
containing nodes and edges with substantial 3-spectral 
centrality. This suggests that spectral centrality mea- 
sures may be used in conjunction with spectral clustering 
to optimize parameter choices. 

Immune System mediator network 

For our last example, we consider a dense weighted 
network describing the mediators in the human immune 
cell network [3D]. From data available in the literature 
[26], we construct a bipartite graph between 19 cell types 
of the Immune System (e.g. T and B lymphocytes, neu- 
trophyls, etc.) and 109 mediators of their mutual inter- 
action (such as cytokines, chemokines, etc.). This graph 
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FIG. 5: A comparison of spectral centrality and other 
centralities over windows of the S&P 500 data. 

is then collapsed to directed cell-cell interaction network. 
While the network is almost completely connected, the 
link weights - indicating the number of mediators for each 
pairs of cell types - carry the structure. 

In [30 an efficiency measure [IS] was used to character- 
ize the mediator relevance inside the network and quan- 
tify the relevance of each mediator. This was achieved 
by selective removal of the mediator from the original bi- 
partite graph, collapsing the network and comparing the 
efficiency of the resulting network to that of the original. 

For our application, we compute the 1-spectral cen- 
trality of the symmetrized cell-cell network, Co, via 
a deformation associated with each mediator. We let 
Bi = Co — Ci where Ci is the symmetrized cell-cell net- 
work after the i*^ mediator has been removed from the 
bipartite network and the bipartite network has been col- 
lapsed. We then compute the 1-spectral centrality using 
equation (|2|. 
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FIG. 6: Comparison between Efficiency and 1-Spectral 
Centrality for a Immune System mediator network. 

Some of the mediators receive a high ranking from both 
centrality measures (e.g. TGF-/3, a fundamental anti- 
inflammatory citokyne) except for a group of mediators 
on the top left of the figure (CSF-1, SDF-1, Thrombopoi- 



etin and IL-7) which have high 1-spectral centrality but 
low Efficiency. In the previous analysis, top-ranking me- 
diators were related to innate response in the Immune 
system, an ancient and highly conserved mechanism that 
involves among the oldest elements of the mediator net- 
work. The efficiency centrality measure thus seems to 
have selected the mediators that reflect the Immune Sys- 
tem evolution, in which the oldest elements should be also 
the most connected ones (as it happens in a preferential 
attachment-like model of network growth) . Interestingly, 
the most central mediators obtained by 1-SC have a com- 
pletely different role, since they are known from literature 
to be jointly involved in cell maturation and differenti- 
ation [8], a mechanism known as hematopoiesis, funda- 
mental for the maintenance of the whole Immune System. 
Even if we have a large overlap with the previous analysis 
(the two centrality measures look nonlinearly correlated) , 
the information about mediator relevance from the two 
measures is not the same. The relevance of the mediators 
in the network as attributed by 1-SC seems to be less de- 
pendent on network evolution issues, and more related to 
biological and functional aspects. 

This example demonstrates the use of general set de- 
formations for spectral centrality measurements. In this 
case, there was a natural deformation associated with 
each mediator. However, because the 1-spectral central- 
ity is related to a subset of the network determined by 
an underlying bipartite network, it is inappropriate to 
compare them to other centrality measures of Cq. 



IV. DISCUSSION 

We introduce a new notion of centrality for graphs, 
fc-spectral centrality, based on measuring the effects of 
deformations (i.e. node, edge or network subset removal) 
of the graph Laplacian on its eigenvalues. 

The fc-spectral centralities reveal information concern- 
ing the importance of different parts of the graph with 
respect to its geometry. For example, 1-spectral cen- 
trality identifies features relevant to the algebraic con- 
nectivity. Higher spectral centralities yield information 
relevant to the spectral embedding of the graph as well 
as to spectral clustering. We see that for small, sparse, 
unweighted, symmetric networks, spectral centrality be- 
haves similarly to other standard centralities. However, 
for larger, denser, weighted symmetric networks, spectral 
centrality provides a distinct, new, and potentially useful 
tool in network analysis. 

We see an initial setting in which the /c-spectral cen- 
tralities are appropriate - consensus formation. For net- 
works which reflect the substrate on which opinion propa- 
gates and consensus does or does not form, the /c-spectral 
centralities provide relevant summary statistics of impor- 
tance. Basic models of consensus formation posit opin- 
ion evolution via dynamics governed by the graph Lapla- 
cian. Thus, by construction, the fc-spectral centralities 
measure importance with respect to those opinion dy- 
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namics. Our first two dense weighted network examples 
can be interpreted through the lens of consensus forma- 
tion. In each case the behavior of the nodes is correlated 
(at different strengths) with that of other nodes. Back- 
ground opinion formation dynamics - opinions about bills 
in the U. S. House of Representatives and opinions of 
traders about the similarities of different equities - create 
a link between the correlation and consensus dynamics. 
In the example of the U.S. House of Representatives, if 
we assume that representatives are influenced by other 
representatives with strengths associated with their so- 
cial/professional ties, this network is then well modeled 
by the consensus formation dynamics. For the equities 
market, we assume a hidden network of traders who form 
opinions about the equities they trade. The opinions 
translate into close prices for equities as the traders buy 
and sell their various holdings. Thus, the time series of 
close prices is a reflection of the consensus dynamics of 
the traders and exhibits its own dynamics as a proxy 
for the aggregated dynamics of the traders. The equities 
network we construct is therefore a proxy for a consensus 
network of unseen actors. 

In this context, we see that the collection of /c-spectral 
centralities deliver better measures of centrality than ex- 
isting measures. In the roll call network, the 1-spectral 
centrality reflects our common notion of ideological cen- 
trality better than all other measures except random walk 
centrality. But, the lack of information in the higher cen- 
tralities confirms that the information in the 1-spectral 
centrality is essentially complete. In contrast, the first 
three spectral centralities for the S&P 500 network all 
carry significant information which show three different 
groups of important nodes and identifies the scale at 
which they are important. In our experiment with cen- 
trality measures over a windowed version of the S&P 500 
network, we see further evidence that the spectral cen- 



tralities outperform other measures in this application. 
Indeed, the 1-spectral centrality is the only measure to 
capture the evolution of importance and centrality as the 
network changes over time. 

The third example of a dense weighted network, the 
Immune System network, provides an example where it 
has already been observed that common centrality mea- 
sures fail to provide a clear ranking of mediator relevance 
[5U] , efficiency was therefore used to extract a measure of 
relevance for mediators in this network. 1-spectral cen- 
trality also provides a measurement of relevance for me- 
diators, but it provides a different ranking. While some 
mediators involved in anti-inflammatory responses had 
both high efficiency and 1-spectral centrality, the major- 
ity of mediators with highest 1-spectral centrality also 
had low efficiency. These mediators are involved in cell 
maturation and differentiation (hematopoiesis) while the 
top-ranking mediators by efficiency are more related to 
the biological process of innate response of the Immune 
System. While our centrality measure highlights media- 
tors related to fundamental system functions (its main- 
tenance by means of cell maturation and differentiation) , 
efficiency ranking in this case seems to depend on the pro- 
cess of network formation, for which the oldest network 
elements are also the most central. This example pro- 
vides evidence that 1-spectral centrality is a useful tool 
in identifying functional structure in biological networks 
which complements other existing measures. 
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TABLE I: Correlations between centralities for un- 
weighted example networks 
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Correlations between various centralities for the unweighted 
example networks we consider. The rows and columns are 
labeled as follows: D = degree, B = Betweenness Centrality, 
E = Eigenvalue Centrality, RW = Random Walk Centrality, 
I = Information centrality and S = Spectral Centrality. 



TABLE II: Correlations between centralities for dense 
weighted example networks 
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Correlations between various centralities for the weighted 
example networks we consider. The rows and columns are 
labeled as follows: D = degree, B = Betweenness Centrality, 
E = Eigenvalue Centrality, RW = Random Walk Centrality, 
I = Information centrality and S = Spectral Centrality. 



